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Abstract 

We present a family of additive quantum error-correcting codes whose ca- 
pacities exceeds that of quantum random coding (hashing) for very noisy 
channels. These codes provide non-zero capacity in a depolarizing channel 
for fidelity parameters / when / > .80944. Random coding has non-zero ca- 
pacity only for / > .81071; by analogy to the classical Shannon coding limit, 
this value had previously been conjectured to be a lower bound. We use the 
method introduced by Shor and Smolin of concatenating a non-random (cat) 
code within a random code to obtain good codes. The cat code with block 
size five is shown to be optimal for single concatenation. The best known 
multiple-concatenated code we found has a block size of 25. We derive a gen- 
eral relation between the capacity attainable by these concatenation schemes 

and the coherent information of the inner code states. 
03.65.Bz, 89.80.+h, 89.70,+c 
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I. INTRODUCTION 



It still comes as a surprise to many physicists that the error correction techniques that 
we now know to exist for quantum states are basically digital and not analog. Even after 
the discovery of quantum error correcting codes [ffj, it was felt that the analog metaphor 
must be more appropriate; after all, a quantum state is specified by a continuous set of 
complex numbers, and the fundamental physical process which we consider as "noise" on 
the quantum state, unitary transformations involving the state and its environment, also 
are drawn from a continuous and not a discrete set. Nevertheless, the entangled structure of 
quantum states, which has no analogy in classical mechanics, permits an essentially digital 
treatment of errors. 

In fact, quantum error correction as we presently understand it |Ï HÏ8fl is required to be 
oblivious to the continuous nature of the quantum state: error correction is accomplished 
by using a coded subspace such that the effect of the errors and error-correction scheme 
are both independent of the direction of the state vector in that subspace. Furthermore, 
these error- correcting actions are not continuous, but are drawn from a discrete set. This 
is related to the fact that the continuous action of the environment can also be "digitized," 
in the following sense: quite generally, noise on a quantum state can be described as a 
transformation which takes a pure quantum state \l/ to a mixed state p given as an ensemble 
of pure states {Aj\I/}. Each of the set of operators Ai can be written as a linear combination 
of some fixed operator basis, Ai = J2j a ijEj. The fixed set of operators Ej are the "error 
operators" of the quantum channel. There form a finite set with d 2 elements, where d is the 
dimension of the Hilbert space of For qubits (d = 2) it has become conventional to use 
the error basis Ej = I (identity), a x , a y , o z (the Pauli matrices). 

These four operators have a very "digital" interpretation. In the simplest memoryless 
noisy bitwise channel, the "binary symmetric channel", each bit is either left alone or is 
flipped. This corresponds to the actions of the operators I and a x on the z-basis qubit 
states. In the quantum case there are just two other actions; we refer to the action of a z 
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as a "phase flip", and that of a y as "both a bit flip and a phase flip." An alternative point 
of view is to represent the quantum state as a two-bit object (cf. PJI5fl), or an object in a 
four-element field GF(4) |nj, so that the four operators are just the four possible digital 
noise actions on them. The only "analog" features left in the description of the quantum 
channel are the continuous amplitudes a^; but these play a very similar role in the quantum 
noisy channel to the bit-flip probabilities in the classical digital channel. 

In this paper we will concern ourselves with the quantum capacity of a simple qubit 
channel, the depolarizing channel. This channel is completely characterized by one fidelity 
parameter /; with probability / the qubit passes through the channel undisturbed (A\ = 
\ffl), while with equal probability g — (1 — f )/3 the qubit is subjected to a rotation by one 
of the three Pauli matrix operators <r x ,y,z (^2,3,4 = \fg^x,y,z)- 

Defining the quantum capacity requires a discussion of the quantum error correction 
codes mentioned above. This discussion will be given in detail in Sec. II. Suffice it to 
say now that many quantum codes [n, k, d] are now known p|j3|J^,[^-|9|,pT|-p!5| , in which an 
arbitrary state of k qubits are coded into a state of n > k qubits in such a way that if no 
more than t = \_d/2\ of the n qubits are subjected to an error, the original fc-qubit state can 
nevertheless be perfectly recovered. The rate of this code is r = k/n. 

With this, the quantum capacity Q{x) of a quantum channel x can De defined: Q{x) is 
the maximum number Q such that for any rate R < Q and any 5 > there exists a quantum 
code C with rate k/n > R such that after the action of x an Y state i/j encoded by C can be 
recovered with fidelity at least 1 — S at the receiving end of the channel P, |ï7| , [i8|l . 

Naively one might expect there to be a relationship between the achievable capacity Q 
for a depolarizing channel of fidelity / and the rate r of a code in which 1 — - ~ /, since 
1 — / is the expected fraction of qubits on which errors will occur. In fact, there is no direct 
relationship because the defmition of capacity does not require that all errors of weight less 
than t are correctable, but only that the fraction of uncorrectable errors vanishes for large 



block sizes. For example, Rains has shown [B5 that all families of codes for which 1 — - < 
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have vanishing rate r; nevertheless, Q is known to be greater than zero for a range of / < |. 
Indeed, in our previous work on "one-way hashing," p],|3| we identified a method for which 
a non-zero capacity could be attained down to about / = .81 POfl . The capacity attained 
has the form of one minus a von-Neumann entropy (see Eq. (|TI|)). This quantum expression 
bears a close resemblance to the result of classical information theory, where the maximum 
information reliably transferable though a noisy channel is limited by the Shannon bound 

H 

C<1-H( X ) (1) 

where H(x) is the average entropy introduced in a bit by classical channel x- 

In the classical problem the Shannon bound is achieved by a random coding procedure; 
the one-way hashing protocol which we invented is the natural quantum analog of random 
coding. Thus, it was natural to expect that Eq. (|TT|) would also be the upper bound on the 
quantum capacity for the depolarizing channel. 

However, the quantum coding problem has not proved to be exactly parallel to the clas- 
sical one; recent work has identified several important properties (the pipelining inequality, 
and subadditivity of mutual information) which are true for classical capacity measures, 
but are not for the quantum version. More concretely, it was recognized early that quan- 
tum error correcting codes can have a property referred to as "degeneracy" which is not 
permitted in the quantum case: two different errors may be indistinguishable by their error 
syndromes, but may nevertheless be both correctable (see Sec. |VÏ| ). Spurred by intuitive 
ideas of how this degeneracy might improve the capacity of the quantum channel, Shor and 
Smolin |TJJ explored some non-random coding strategies, and found a range of depolarizing 
channels (very noisy ones) for which the obvious analog of the Shannon bound is violated; 
a higher capacity is attained than for random codes. 

The main point of this paper is to present the Shor-Smolin discovery using the more 
modern and streamlined tools for describing quantum coding which have been developed 
recently. We will formulate the capacity calculation in terms of the orthogonal-group for- 
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malism [Ï2],[Ï3| which has proved very successful in systematizing almost all known quantum 
codes (the additive codes) J23J. We identify new quantum weight enumerators (cf. [|T6|j), 
for the stabilizer cosets, with which a compact expression for the capacity can be given. 
We show that the Shor-Smolin codes can be understood in the language of code concatena- 
tion which has been very popular for the discussion of fault-tolerant quantum computation 
|26| -j30|. We establish a rather general relation between concatenated-code capacities and 
the quantum coherent information [||J|, a quantity believed to be (but not proved to be in 
general) the quantum capacity |3Ï|]22|1 . Finally, we show that, using the coset-enumerator 
formulation, closed-form expressions can be derived for the capacities of the original Shor- 
Smolin protocols, which permit us to perform a more extensive quantitative exploration of 
the performance of these codes. We do not yet know what the actual attainable capacity of 
the depolarizing channel is, but hopefully the techniques explored here may provide a clue 
of how to obtain this result. 

This paper is organized as follows: Sec. [TT] briefly reviews the orthogonal-geometry group 
theory which has been introduced for the classification of quantum codes, and introduces 
the necessary coset weight polynomials. Sec. |TTT| derives the average-entropy expression 
for concatenated codes as originally obtained by Shor and Smolin. Sec. [TV] shows that 



the capacity attained by the concatenation procedure is equal to the quantum coherent 
information. Sec. |V| presents the compact expressions which we have obtained for the 
concatenation using the "cat" code, which give capacities exceeding the random-coding 
bound. Sec. |V| presents our conclusions, and some thoughts about the use of degenerades 
to attain improved capacities using concatenated codes. 

II. GROUP-THEORETIC CHARACTERIZATION OF CODES 

We consider the group E introduced in [K| which describes all possible Standard errors 
on n uses of the channel (described by products of Pauli-matrix operators on a set of n 
qubits). The bar indicates that the group is understood to be taken modulo phases ±1, 
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±i. The dimension of É is 2 2n . As JÏ3| showed, the Abelian subgroups of E play a central 
role in the theory of quantum error correcting codes. Consider such an Abelian subgroup 
S; again, we will work only with S, from which phases have been removed. We define k < n 
by specifying that S has n — k generators (and thus is of dimension 2 n ~ k ); then any of the 
2 fc -dimensional eigenspaces (denoted Cj, < i < 2 k ) of this set of Pauli-matrix operators 
forms a quantum code. We now introduce S ,_L and S 1 - where S 1 - consists of all elements in 
E which commute with all the elements of S and S 1 - is S ± modulo phases. The dimension 
of the set S 1 is 2 n+k . To analyze the error-correction capability of the code, we define the 
weight of an operator e in E, wt(e), as the number of Pauli-matrix operators (either a x , a y , 
or a z ) appearing in e. If the minimal-weight element of S ,_L \S' (i.e., the set S excluding 
the elements in S) has weight d, then the correct state can be restored after d — 1 erasures, 
which means also that it can correct arbitrary errors on any t = \_d/2\ qubits. d is referred 
to as the "distance" of the quantum code, and the notation for the code is [n, k, d]. 

All these facts have been discussed previously for quantum codes; but for the present 
purposes we need to introduce some additional mathematical objects, the cosets of S in 
È. To understand why these cosets might be natural objects to consider for quantum error 
correcting codes, we recali that the elements of S are the stabilizers of the code; this means 
that if the error suffered by the set of n qubits is any member of S, then the code state is 
unaffected. Note that in the same way any element of the coset s a S acts identically on the 
code; all such elements act as if just the error s a had occurred. For this reason, these cosets 
will play a central role in the analysis below. Note that because É is Abelian (N.B. E is not 
Abelian) no distinction need be made between left and right cosets. 

We will need to consider three different coset partitionings: 1) The cosets of S ± in 
E. Consider the "transversal" of E, the set G = {a} C É which generates the coset 
decomposition of S in É. That is, e Q 5 ,_L f| épS ± = if a ^ (3, and \J a e a S ± = É. The 
dimension of G is 2 n ~ k . We indicate the ath coset of the coset decomposition as e a S ± . 2) 
The cosets of S in S- 1 -. In this case we denote the set generating all the distinct cosets as 
G 1 - = {Sq} C E. The dimension of this set is 2 2k . A typical coset is indicated as s^S. 3) 



The cosets of S in E. This is just the direct pro duet of 1) and 2), the generating set is 
G (g> G- 1 -, and the (a,/3) coset is denoted é a SpS. This hierarchy of coset decompositions is 
indicated in Fig. [I] for k = 1. 

We introduce a weight-enumerator polynomial P for set C as: 

P(C) = ^/"-wt(e)^wt(e) ; ( 2 ) 
eGC 

where C C É. 

We note that this weight polynomial is directly related to the Shor-Laflamme weight 



enumerator ||16|| . Their weight function A d is (we use the normalization choice of Rains |p2| ) 



A d (O u 2 )= J2 ti&OxMe'Oz). (3) 

e'EE 
wt(e') = d 

and their weight-enumerator polynomial is 

n 

A(z,0 1 ,0 2 ) = Y / MOi,0 2 )z d . (4) 
To relate these to Eq. (0), note that 

A d (e, e) = 2 2n 5 wt (e),d, (5) 

from which we see that 

So our new polynomials are simple functions of those which have been introduced in previous 
work in quantum error correcting codes. 

Our weight polynomial P{C) has particular significance for the depolarizing channel with 
fidelity f when C is the one of the various cosets which we introduced above: 

• P{S) is the probability that the coded quantum state will leave the depolarizing chan- 
nel without error. Each distinct operator e G S is an action of the channel which has 
this property, and the probability of that action. 
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P(s^S) gives the probability that the coded state leaving the quantum channel is 
detected to have no error, but has actually been rotated inside the code eigenspace by 
s^- This is so from the definition of s^: since it commutes with all elements of S, 
it does not change the eigenvalues of S which are detected in the channel-decoding 



operation pTfl ; and, since by definition ^ S, it performs a non-identity rotation of 
the coded state inside the code eigenspace. In fact, every member of the coset s^S 
performs the same rotation. 

• P(5 ,_L ) gives the probability that no error will be detected upon decoding, regardless 
of whether the final quantum state is correct or not. 

• P(e a S'- L ) is the probability that decoding detects error è a , regardless of the rotation 
of the coded state. 

• P(e/3S^S) is the probability that decoding detects error ep, and the coded state is 
rotated by éps^- 

These will be the essential tools for developing a compact formula for the attainable capacity 
for code states, and establishing the identity of the coherent information with the Shor- 
Smolin quantum channel capacity. 

III. SHOR-SMOLIN CONCATENATION PROCEDURE 



In order to formulate the main result, we first review the Shor-Smolin procedure JT4 
for sending reliable qubit states, with a finite capacity, over a depolarizing channel. Just 
as in conventional channel coding, it involves an additive code specified above by S. In 
conventional channel coding shown in Fig. |2|, the additive code is used as follows: the state 
|£) to be transmitted (we specialize in the figure to a single qubit state) is rotated by the 
encoding unitary transformation 8 into the eigenspace Cq of the operators in S. When this 
state passes through the depolarizing channel, it is rotated into one of the other eigenspaces 
C m with some probability: we will analyze this process in detail later. Then after passage 
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through the noisy channel, the decoding transformation D places n — k of the qubits (the 
lower n — 1 in the figure) in a state such that, when they are measured in the Standard 
basis, they give the eigenvalue of each of the n — k generators of S, that is, it determines 



which of the spaces C m the state had been placed into by the noise []27J. So long as the 
errors produced by the channel are restricted to have weight no greater than \_d/2\, then a 
rotation U can always be determined which restores the state to its noiseless form 

The discussion below uses another protocol for coding shown in Fig. |||, the purification 
protocol of For the depolarizing channel the two procedures of Figs. ^| and ||| are 

completely equivalent. Ref. || gives a detailed derivation of the mapping of the first protocol 
to the second. In the protocol of Fig. |3], the sender begins with n completely entangled states, 
in this example the Bell state 

$ + = -^(|00> + |ll». (7) 

The sender keeps one half of each of the n Bell states, and the other n partides are sent 
through the depolarizing channel to the receiver. When we are sending halves of EPR-Bell 
partides through the channel, we no longer discuss the action of the channel in terms of 
rotations among different code spaces (no coding transformation has yet been applied to 
these states); rather, the state of the system at slice XI in Fig. |3| (which in general is at two 
different times for the two different sets of partides) is one in which the set of Bell states 
has been rotated to a set of some of the other Bell states with various probabilities which 
we will discuss shortly. The full set of Bell states is 

$ ± = -^(|00)±|ll)), (8) 
* ± = -^(|01>±|10». (9) 

The probability of a particular set of n Bell states at slice XI is determined by the rule that 
the Bell state remains a <J> + with probability /, and becomes one of the three other states 
^ ± with probability g = (1 - f)/3. 

Using the decoding transformations T> and T>* in Fig. |3], followed by measurements on 
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both ends, classical communication from the sender to the receiver, and the final unitary 
transformation U, the sender and receiver can come into possession of a "purified" <£> + pair, 
which is then used to send the qubit state |£) by teleportation |33|] (for details see ||). 

The two methods of employing the channel shown in Figs. |2| and |3] are completely equiv- 
alent. But it will be useful to use both points of view for explaining the generalized channel 



transmission protocol of Shor and Smolin ||14|| , and we will continue our review using both 
languages. 

We will need to apply our capacity definition of Sec. | to the purification picture. The 
fidelity F v of the depolarizing channel output can be most simply be defined in this picture 
in the following way: at the end of purification (slice XP in Fig. |3]) the output is desired 
to be a collection of k <£> + states; if the code scheme is a successful one, then the overlap 
between the actual state at this slice pxp and the desired Bell state will be high; thus the 
fidelity for an encoding T> is 

F v = (($+) fc | PxP |($+) fc >. (10) 

The capacity Q is simply the best rate k/n for a T> for which this fidelity approaches unity, 
since each high-fidelity EPR pair can be used to teleport one qubit. 

The maximization of Q has proved to be difficult. But a variety of code families have been 
introduced for which finite Qs are known, establishing useful lower bounds on the attainable 
capacity. One of the most useful is the sequence of random additive codes, referred to in 
the original papers [QH as "one- way hashing." As the name suggests, these sequences are 
built by selecting, at random, an Abelian subgroup S from the group of all Pauli matrices E 
for successively larger block sizes n. Bennett et al. show that almost all such sequences 
attain the "hashing capacity" 

Qh= hm - = 1-S w (f). (11) 

n— >oo n 

Thus, Ql·l(f) is a lower bound on the attainable capacity. Sw(f) is the von-Neumann 
entropy of one Bell state after one of its partides has been passed through the depolarizing 
channel, and it is given by 

10 



S w (f) = -f\ogf-3g\ogg. (12) 

Here is a brief explanation of why one-way hashing achieves the capacity of Eq. (|TT|). 
The entropy of the mixture of Bell states at slice XI is just nSw(f)- The decoding can 
be simply thought of as a sequence of measurements of the n — k operators which are the 
generators of S. Each of these measurements has two outcomes, splitting the set of possible 
remaining states in two; thus, it has the potential for reducing the entropy of the state by 
one bit. Ref. fU provides arguments for why, for almost all choices of S and for large n, 
each measurement in fact succeeds in extracting one bit of entropy The total state remains 
a mixture of Bell states, so that if k is chosen so that the entropy is reduced to zero, i.e., if 
nS(W) — (n — k) = 0, then the Bell mixture becomes a pure state, which is to say that the 
final state is one particular set of known Bell states, which can always be rotated with U to 
become a set of <£> + states. Thus, purification has succeeded, and the ratio k/n attains the 
value given in Eq. (|TT|). 

This result naturally raises the question of whether there exist any non-randomly chosen 
sequences of codes which could attain a capacity exceeding Eq. ([□]). While appeal to anal- 
ogous classical results and other thinking suggested that random coding would be optimal, 
the Shor-Smolin construction which we now review shows that higher capacities are attain- 
able. Their construction involves what is known as concatenation; it is illustrated, for both 
versions of the quantum coding protocols, in Figs. |] and [|. In the language of Fig. [|, the 
idea is that instead of sending the qubits as encoded by the random encoder £ directly into 
the channel, they are encoded once again in another additive code [p, k,d], and it is these 
n x p qubits that are finally sent through the channel. The codes whose capacity we will 
consider involve n — > oo, but fixed p. While we tend to associate "good" (i.e., high-capacity) 
codes with large distance d, we will find that the desirable inner [p, k, d] codes actually have 
a small distance d. As we discuss at the end, it may be the "degeneracy" of this code which 
is relevant. 

Shor and Smolin showed [111] that the following capacity is attainable by this concate- 
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nated scheme: 

Qss = ^(l-S X2 ). (13) 

The 1/p just comes from the fact that the whole scheme requires p x n bits rather than just 
n bits to be sent through the channel. Sx2 is the average entropy of each bipartite state at 
slice X2 in Fig. (the total entropy at slice X2 is nSx2)- Shor and Smolin noted that this 
entropy is not given by the von-Neumann entropy of the quantum state at this slice, because 
of the presence of the results of the classical measurements. Rather it is the average of the 
von-Neumann entropies of the quantum states conditional on the measurement outcomes: 

S X 2= E Pr«W)= £ Pr(i)/u({Pr(Bi|i)}). (14) 

iÇmcas. iÇmcas. 
outcomes outcomes 

It is this entropy that is to be reduced to zero by the random-hashing stage of the decoding. 
In the second part of Eq. (|Ï4|) we have specialized to the case where the inner code has k = 1 
(and thus produces just one qubit-pair state in Fig. ||). In this case the output is a mixture 
of the four Bell states {Bj} = $ ± , ty^, so that the entropy just involves the probability of 
Bell state Bj conditional on the particular measurement outeome i: 

Pr(Í ^ = "P§p Pr W = EPr(^,0· (15) 
The h A function in Eq. (|HD on the set {%i} is defined by 

n n 

hn({Xj}) =-J2 X 3 l °&2 Bj, E X 3 = l - ( 16 ) 

i=l 3=1 

By using the elementary algebraic properties of the h n function Sx2 may be simplified so 
that Qss is expressed as 

Qss = ~[1 + h N ({Pr(i)}) - h iN ({P<B vï )})]. (17) 

Here N is the number of distinct measurement outcomes; for an additive [p, k — 1, d] code, 
N = 2 p ~ k = 

The probabilities appearing in Eq. (|Ï7|) have appeared above; in fact they are equal to 
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Pr(z) =P{e i S ± ), (18) 

Pr(S i ,z) = i 3 (sj-e i 5). (19) 

Eq. (|T8| ) follows from the fact that the members of the set S 1 - are, by definition, those errors 
which all lead to the measurement which indicates the "no-error" condition; thus, its cosets 
in É, éiS- 1 , each contain the errors which all lead to the same measurement i. Finally, the 
weight polynomials are, as discussed above, constructed so as to enumerate properly the 
probabilities of these sets. Eq. (|Ï9l) follows similarly: The set S indicates those errors which 
lead to the "no-error" measurement and leave the Bell state B^ in the correct <£> + state. 
Furthermore, the coset sféiS contains those errors which lead to measurement i and Bell 
state Bj. It should be noted that the error operations sj- have the effect of performing a 
unitary operation of the coded qubit; the four operations are 1) Sq = I (the identity), which 
leaves the Bell state <3> + unaffected, 2) s^, which performs a coded a x , leading to a final Bell 
state B x = 3) Sy which performs a y and leads to B y = and 4) which performs 
a z and leads to B z = $~. So, the weight polynomial in Eq. fll^D is constructed to evaluate 
the probability that a member of the coset occurs. 
Finally we may rewrite the capacity equation as 

Qss = -[1 + hjfüPieiS 1 -)}) - h m ({P{sj-éiS)})]. (20) 

IV. RELATION OF Q SS TO QUANTUM COHERENT INFORMATION 

The two noisy-channel transmission constructions which we have discussed above are 
equivalent to yet a third one shown in Fig. |6], which has been extensively discussed in the 
literature p,^,^]. The rationale of introducing the one-qubit ancillary system 'i?' is that it 
is the minimum-size ancilla required to "purify" the input of the channel, that is, to make it 
part of a larger pure state [Q (this is a different sense of the word "purification" than used 
in [Q]). In this scenario there is an important information-theoretic measure, the coherent 
informatíon; at slice X3 this is given by the difference of two von Neumann entropies: 
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h= l - (S( PQ ) - S( PRQ )) . (21) 



Refs. p2| , |35| show that I e provides an upper bound for the quantum channel capacity when 



maximized over all possible input-state ensembles and quantum codes. What we will show 
is that the achievable Shor-Smolin capacity Qss hi fact attains the coherent information for 
the same additive quantum code, and for the input as in Fig. |[ To establish this we need 
to show the following two equalities: 

S( PQ ) = l + h N ({P(e l S ± )}) } (22) 

and 

S(prq) = hwdPisfeiS)}). (23) 

Establishing these just requires a consideration of how the noise acts on the input state in 
Fig. |6|. For Eq. (^), we note that the density matrix pç before the action of the noise is just 
an equal mixture of the \0)lq and |1)lo states, where the subscript indicates that these 
vectors lie in the eigenspace Cq. Each eigenspace Cj, < i < 2 n ~ k , is spanned by a pair of 
vectors \0)lí, where we can define the and 1 vectors by 

|0)w = ei|0>£o, |l) Li = e i |l)io, (24) 

where is the coset-generating operator (see Fig. [[]). The importance of the basis |0, 1)lí is 
that the density operator pg after the action of the depolarizing noise is diagonal in it. The 
diagonal matrix elements (i.e., the probabilities) for each vector is evaluated by noting that 
the state \0}lí is reached in four possible ways: 1) the initial state is \0)l (with probability 
|) and an operator of the coset èiS is applied by the channel, 2) the initial state is |0)lo and 
an operator of the coset éisjrS is applied by the channel, 3) the initial state is |1)lo (also 
with probability |) and an operator of the coset Cisj^S is applied by the channel, or 4) the 
initial state is |1)lo an d an operator of the coset èiSyS is applied by the channel. Each of 
these is given by the appropriate weight polynomial, so 
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(0\p Q \0) Lï = ~P{eiS) + \P{ei8^S) + \P{e^S) + §P(e^S) (25) 
= \P{eiS L ). (26) 

The enumeration of the ways that the state \1)lí can be arrived at is identical, with Os and 
ls interchanged; so we find that this matrix element is identical: 

(l|p g |l> ií = (0|pg|0) Li . (27) 

Because it is diagonal, the von Neumann entropy of pg is just the ordinary entropy of the 
probability distribution: 

S(p Q ) = h 2N ({\P(e t S L ), \P{eiS^)}) = 1 + ^({P^)}). (28) 

And thus Eq. ( ^2|) is established. The reasoning needed to establish Eq. (|23|) is very similar: 
the joint state of systems R and Q after encoding but before the noise is 

\\^r)\^q)w + \\Ir)\Iq)lo = ^- (29) 

In this notation the i in means that the state in the Q subsystem lies in the C« eigenspace. 
After the noise the density matrix prq is diagonal in this generalized Bell basis, with the 
probability of the state being Bji given by 

(BjilpuQlBji) = P(èisfS), (30) 

since it is again only members of a particular coset that will produce a final Bji state. (This 
discussion can equivalently be given in terms of the behavior of the [p + 1, k = 0,d] code 
to which the composite system belongs.) From Eq (|3H|), the desired result Eq. fl2"3|) follows 
immediately, so the identity between the Shor-Smolin capacity and the coherent information 
is established for any code. 



V. Qss FOR THE "CAT" CODE 
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A. Closed-form evaluation 



It has not proved easy to evaluate the Shor-Smolin capacity Eq. (|20|) (or the equivalent 
coherent information) for a general concatenation. But a closed-form evaluation has proved 
possible for one important family of inner [p, 1, d] codes which we refer to as "cat" codes. In 
the cat code for p > 2 the stabilizer group S is generated by the operators 

Ozio"z2, &zi&z3, a z ia zp . (31) 
For this code the code space C is spanned by 

p qubits 

|0) L0 = | 00077?> (32) 

and 

|l>xo = |Hl...)· ( 33 ) 

Thus, the source density matrix before passage through the channel is, using the Schumacher- 
Nielsen notation (Fig. ||) ||, 

PQdn) = ||000...><000...| + ||111..-><111...|. (34) 
A purification of this density matrix involving just one qubit in the subsystem R is 

p qubits p qubits 

* RQ = -L(IOOÓÒ^) + |lTTÏT)). (35) 

Here the first qubit is the one belonging to system R. This wavefunction has been referred 
to as the "cat state" in the literature. 

The decoding network T> for this code is extremely simple, just consisting of the sequence 
of XOR gates shown in Fig. ^. Shor and Smolin provide a detailed argument [Ï4j for counting 



all the probabilities in Eq. fll~4]) by determining how each different type of error process is 
modified by the XOR circuit. We summarize their results here: consider counting the 
probabilities of the cases (including all members of one of the cosets of 5 ,_L ) in which the 
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measurements give exactly r ls, in particular when the measurements of qubits 2 through 
p — r give zero, and qubits p — r + 1 through p give one. It is obvious that the counting is the 
same for any permutation of the qubits; this means that there are ( P 7" L ) equivalent cosets 
being counted. It is this high multiplicity that permits the calculation to be tractable, 
despite the fact that there are exponentially many (in p) coset weight polynomials to be 
evaluated. 

The further four subcases (i.e., the cosets of S; see Fig. [!]) to be evaluated are: 

1) The remaining qubits (qubit 1 of Q and the qubit of R) are in the state The error 
processes for which this occurs are those where there are amplitude (a x ) errors on qubits 
1 through p — r, and an even number of phase (a z ) errors on any of the qubits. We may 
forthwith calculate the probability of this occurrence: 

p^vh E Y,( p ~ r )(/_ )<r r+t - i r- m = 2 p - r - 1 sr r (f + 9) r · (36) 

t (even) i \ 1 / V V 

Here t is the total number of phase errors and i is the number of these phase errors occurring 
on the qubits which already have amplitude errors (leading to a o y error process). The t 
and i sums go over the full range for which the binomial coemcients are non-zero. 

2) The remaining state is For this the error processes are those where there are 
amplitude (a x ) errors on qubits 1 through p — r, and an odd number of phase (a z ) errors on 
any of the qubits. In fact, it turns out that this count is exactly the same as for 

Pr(*-,r)= J2 E( P ~ r ){/_)9 P ' r+t ' Í f r ' t+i = ^' r ~ 1 9 P ' r (f + gY- (37) 
t (odd) i \ 1 ) V V 

3) The remaining state is <£> + . In this case there must be amplitude errors on qubits 
p — r + 1 to p (or no amplitude errors if r = 0), and there must be an even number of phase 
errors. This gives 

Pr($V)= ^ E (ryp-r\ g „ t - if - r -, +l JW + 9r + (f-m, r = 0, 

t (even) t W \ t V 1 + g)P~ r , T > 0. 

(38) 
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4) The remaining state is $~. In this case there must be amplitude errors on qubits 
p — r + 1 to p, and there must be an odd number of phase errors. The result is the same as 
for <3> + except for the r = case: 

Pr($ -, r)= E ^(AíP-ry^^Jw^y-if-^), r=o, (39) 

t (odd) i W V* v I 2 r - V(/ + g) p ~ r , r >°- 

Plugging these expressions into Eqs. (|Ï5l - |ï7l) permits an efíicient calculation of the Shor- 



Smolin capacity for the family of cat codes. 

The threshold of the cat-code family may be computed exactly for p —>■ oo using an 
asymptotic analysis. Briefly, we find that the capacity Eq. ( p0|) is dominated for large p by 
two contributions: 1) Those for the cosets of S with r = (recali that r is the number of 
ones in the measured syndrome). We find that this contribution goes like 

Qss(r = 0) = c(^ r ^pj P . (40) 

2) Those for cosets with r ~ |. This contribution has the form 

Qss(r « |) = - 7 (/) ^8g(f + g)y . (41) 

Here 7(/) > is a fairly complicated function of /. Nevertheless, the threshold for / 
specified by Qss(f) — is simply obtained by equation the bases of these two contributions: 



(f-g) 



2 



\j8g(f + g)- (42) 



f + g 

The relevant root of this equation, / w .81808, is the asymptotic threshold. We have not 
developed any simple intuitive understanding for why this threshold should remain finite as 
p — ► oo, but nevertheless remain worse than the threshold for finite p as we will now see. 



B. Investigations of cat-code capacities 



The simplest codes to calculate are the cat codes Eq. (^,^). The table shows the 
results for vàlues of p from one to fourteen. The capacities Qss of these codes near / = .81 
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are shown in Fig. |8|. We note that odd-p codes work better than nearby even-p codes; the 
lowest threshold fidelity in this family is achieved for p — 5. 

Generically, many other multiple-concatenation codes are possible and may lead to better 
thresholds. We explored the family of codes where the innermost code has a rotated cat 
code for which the stabilizers are: 

Vxi&x2, cr xl a x3 , ... (43) 

and the next-level code remains the ordinary cat code of Eq. (0). The best code we found 
was for both inner and outer cat-codes having p = 5. The capacity of this code was found 
to be non-zero down to a fidelity of / ~ .80944, the best code known |36|]. This threshold is 
still far above the best known lower bound for the threshold of / = | [0,[§ . Unfortunately, 



larger codes become còmput ationally intractable using our methods, because the number 
of distinct cosets scales exponentially with p. It is hoped that another approach, perhaps 
an approximation method for coset weights, will permit a more thorough exploration of 
concatenated codes. 

VI. CONCLUSIONS 

The obvious unanswered question which this work raises is, can any finite capacity be 
achieved for even noisier depolarizing channels, ones with / below the lowest value, .80944, 
achievable with the 25-bit inner code, but above the absolute minimum threshold / = 0.75 



set by the no-cloning argument [[37]]? In other words, do there exist even more clever non- 
random codes (recali [[23[]) for protecting qubits from high levels of noise? 

It may be worthwhile to note here why we initially believed that the use of inner codes 
of the cat type was a promising direction for finding good codes for very noisy channels; this 
belief was based on the property of degeneracy mentioned earlier. While these motivations 
may end up having no more than historical interest, since they have not at present led us to 
any conclusive answer to the qüestions just posed, we hope that it might assist some reader 
who is interested in exploring these problems further. 
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Degeneracy is a property of quantum codes which has no analog for classical error cor- 
recting codes. Degeneracy arises from the fact that two different error patterns can have 
indistinguishable effects on a coded quantum state. This is obviously impossible for a coded 
binary (classical) string, but it is obligatory for additive quantum codes; indeed, the cosets 
of S introduced in Sec. [TJ are precisely these groups of indistinguishable errors. A code is 
considered degenerate if some of the low-weight (< \_d/2\ for an [n, k, d] code) error patterns 
fall in the same coset of S and are therefore indistinguishable. The original 9-bit code of 



Shor [[y] was degenerate; the 7-bit code |0-§| and the 5-bit code |lï|,|3fl are non-degenerate. 



It is known |ï^] that a Hamming-like bound could be easily derived on the maximum 



attainable distance for a quantum code, provided that it was non-degenerate. However, 
in this work the possibility remained open that degenerate codes could attain a greater 
distance. We were thus motivated to consider highly degenerate codes for the attainment 
of high capacity, given the qualitative relationship between code distance and capacity. 
This possibility of attaining large distance using degeneracy has subsequently been rendered 
unlikely by a recent result of Rains [ÏS| who has obtained a bound on d which applies for 



both degenerate and non-degenerate codes and which is tighter than the Hamming bound 
for a substantial part of the p, k, d parameter space. Nevertheless the fact is that the cat 
codes, which we have used successfully to attain high capacity, are highly degenerate: single 
phase errors are all indistinguishable, and all pairs of amplitude errors are indistinguishable 
from the no-error process. All this is true despite the fact that the cat codes have very poor 
distance (d = 1 for all p). 

The best we can say about why this scheme succeeded is that the high degeneracy, by 
making many outcomes indistinguishable, "hides" the large amount of entropy which the 
very noisy channel adds to the quantum states, thus permitting the average entropy Sx2 to 
be below one over a greater range of /. This reasoning is certainly not rigorous; nevertheless, 
in an extensive Monte-Carlo search of other additive codes, we found no other inner code 
with p < 5 which does a better job than the cat code for reducing the average entropy 
and hence attaining any higher capacity. It was further the observation that the cat code 
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"hides" phase error more effectively than amplitude error that motivated us to consider a 
second level of concatenation, in which the innermost code was a cat code with the role of 
amplitude and phase reversed. Of course, this is what led us to the 25-bit code described 
above which give the best capacity to date. 

It is clear that further generalizations of this problem await exploration. The issue of 
attainable capacities for channels other than the depolarizing channel is largely untouched. 
It is fairly clear that for the generalized depolarizing channel, in which the error opera- 
tors are still proportional to the Pauli matrices, but with unequal probability amplitudes, 
the formalism developed here (i.e., the weight polynomials, and the relation to coherent 
information) will go through with little modification, so that would be an easy direction 
for further study. For the much larger space of general channels, nothing better than our 
"twirling" arguments of [[| (which bounds the capacity of any arbitrary channel by that of 
a corresponding generalized depolarizing channel) is presently known. Further extensions of 
the formalism would obviously also be desirable; a generalization of the present approach 
for inner codes with k > 1 would be desirable; also, asymptotic expressions for the capacity 
which would not require an exact evaluation of all the coset weight polynomials could lead 
to significant progress. Certainly there remains much to be done to fully characterize the 
usefulness of the very noisy quantum channel. 

We thank Charles H. Bennett, Artur Ekert, Daniel Gottesman, Emmanuel Knill, John 
Preskill and Eric Rains for many helpful discussions. We are grateful to the Army Research 
Office for support. 
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TABLES 



p 


f 


P 


f 


1 


.81071 


9 


.81002 


2 


.81148 


10 


.81028 


3 


.80987 


11 


.81032 


4 


.81010 


12 


.81056 


5 


.80964 Best 


13 


.81062 


6 


.80991 


14 


.81085 


7 


.80977 






8 


.81004 


oo 


.81808 



TABLE I. The value of the threshold fidelity / for cat codes of size p. Vàlues of p not shown 
all work less well than the random coding method (p = 1). The value for p = oo is analytic from 
Eq. ü). 
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FIG. 1. Hierarchical partitioning of the set E into cosets of S ± , and those in turn into cosets 
of S. The case of S ± dividing into four cosets is special to the case of coding a single qubit. 
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FIG. 2. Quantum channel coding, in which (top) the state to be transmitted |£) is encoded 
by £, transmitted through the noisy channel, decoded by T> and restored by U after syndrome 
measurement. The entire encode-transmit-decode process can be through of a module (double 
box, below) to be used in concatenation (see Fig. 01). 
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FIG. 3. A protocol for transmitting through the noisy channel equivalent to Fig. || which uses 
entanglement purification and teleportation. The sender passes halves of Bell states ( < £ >+ ) through 
the channel to the receiver; the degraded pairs are purified (after slice XI) by T> and T>* (same 
T> as in Fig. ||). The purified pairs at slice XP can then be used to transmit the state |£) from 
sender to receiver by teleportation. Below: the Bell-state distribution and processing with T> and 
V* may be used as a module (double box) for the concatenation of Fig. ||. 
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FIG. 4. Concatented coding for channel transmission. The inner code (double box) is the 
encode-transmit-decode module of Fig. g In the Shor-Smolin procedure the outer part is a 
random code. 
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FIG. 5. Same as Fig. § but using the purification-teleportation protocol. 
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FIG. 6. The Channel protocol as considered by o in its treatment of coherent information. 
The 'Q' subsystem in the one transmitted through the channel, while the single qubit 'R' remains 
behind. 
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FIG. 7. Quantum network for decoding the "cat" code, shown for p = 5. The same network is 
used for encoding. 
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FIG. 8. The yield, i.e., capacity QsSi a s a function of fidelity / for inner cat codes of size p for 
of various vàlues of p. Note that the curves are all in p order from p = 1 to p = 7 along the right 
side of the graph. 
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FIG. 9. The threshold value / for which Qss = vs. p. Note that the points fall on two 
smooth curves, one for even p and one for odd p. The value at p — > oo, / ps .81808, is obtained by 
asymptotic analysis Eq. (fÏ2|). The heavy dashed line at / ~ .80944 is the best known threshold, 
for the twice concatenated 25-bit scheme (Sec. |V A[ ). The light dashed line at / ~ .81071 is the 
threshold for ordinary quantum random coding, equivalent to the p = 1 cat code. 
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